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(54) Method and system for multimedia conferencing 



(57) A multipoint control unit (1 04) is provided which 
allows for dynamic codec selection. According to one 
embodiment, the MCU (104) causes endpoints (102, 
1 06) to renegotiate their codec selections if a most-com- 



mon ly available codec is not being used, upon entry of 
new parties to a teleconference. Alternatively, the codec 
renegotiation may be performed each time a user 
speaks, to optimize for maximum transmission quality 
or for minimizing transcoding. 
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Description 

BACKGROUND OF THE INVENTION 

[0001] The present invention relates to telecommuni- 
cations systems and, particularly, to an improved sys- 
tem and method for multimedia conferencing. 
[0002] The ITU-T (International Telecommunications 
Union Telecommunications Sector) Recommendation 
H.323 defines a set of protocols for communicating us- 
ing audio, video and data over packet-switched net- 
works. To accommodate multipoint conferences (i.e., 
those involving three or more parties), the Recommen- 
dation H.323 defines a multipoint control unit (MCU) to 
coordinate the conferencing. In particular, the MCU is 
required by the Recommendation H.323 to include a 
multipoint controller (MC), which handles H.245 signal- 
ing. In addition, the MCU may include one or more 
multipoint processors (MP), which mix and process the 
data streams. 

[0003] The MPs may also provide conversion, or 
transcoding, between different codecs. However, typical 
MPs transmit at the highest quality codec each user will 
support, whether or not it is necessary. For example, if 
someone with a high quality G.711 codec is talking, us- 
ing the "best codec" method allows everyone to receive 
the voice with the highest possible quality from the co- 
dec. However, if someone with a lower quality codec (e. 
g., G.723) is speaking, their voice is distributed to the 
G.711 users with G.711, which is wasteful. 
[0004] This process is illustrated schematically by 
way of an example in Table 1 and FIG. 1 A and 1B. In 
the example shown in Table 1 , User A has GSM, G.723, 
and G.711 capabilities; User BhasG.711 andG.723ca- 
pabilities; User C has G.723 capabilities; and User D 
has GSM capabilities. 



Table 1 



User A 


User B 


UserC 


User D 


GSM 


G.711 


G.723 


GSM 


G.711 


G.723 






G.723 









[0005] As shown in FIG. 1 A, if User A communicates 
in a two-party conference with User B, G.711 will be 
used, if possible. If not, then the G.723 codecs will be 
used. Then, suppose User B calls User C and confer- 
ences in User C using the conference feature. The co- 
dec choice is negotiated and the MCU 103 is inserted 
into the media stream to provide transcoding. As shown 
in FIG. 1 B, the MCU 1 03 communicates with User A and 
User B using G.711, and with User C using G.723. If a 
User D having only GSM is added to the conference, 
then MCU 103 will communicate with the User D using 
only GSM. 

[0006] The amount of transcoding the MCU 103 must 



do depends upon which party is talking. When User A 
talks, User B receives the signal as is, and User C and 
User D require transcoding. When User B talks, User A 
receives the signal as is, and User C and User 0 require 
s transcoding. When User C talks, Users A B and D re- 
quire transcoding. When User D talks, Users A, B, and 
C require transcoding. 

[0007] The prior art thus is disadvantageous in that 
the MCU is required to perform transcoding which may 
10 be sub-optimal or even unnecessary. As such, the prior 
art MCUs can waste processing resources. 

SUMMARY OF THE INVENTION 

is [0008] These and other drawbacks in the prior art can 
be addressed by a multipoint control unit (MCU) embod- 
ying the present invention. According to one implemen- 
tation, the MCU determines an optimal codec, for exam- 
ple, based on a highest quality "most common codec" 

20 among parties to a multipoint conference. Alternatively, 
the optimal codec may be chosen to minimize transcod- 
ing. The MCU instructs any parties not using that codec 
to renegotiate their connections with the MCU to employ 
that codec. The determination is made as each party is 

25 added to the multipoint conference. 

[0009] According to another embodiment, the codec 
optimization is made every time a different party talks. 
As each party is identified, the MCU issues commands 
to renegotiate the connections with the endpoints. 

30 Again, the codec may be chosen to maximize quality or 
to minimize transcoding. 

[0010] According to another embodiment, a particular 
party is chosen as having a default codec. That party is 
chosen as being allowed its highest quality codec, with 
35 other parties receiving at their highest qualities possible. 
However, when the other parties transmit, they send 
with a lower quality codec to preserve bandwidth. 

BRIEF DESCRIPTION OF THE DRAWINGS 

40 

[0011] A better understanding of the invention is ob- 
tained when the following detailed description is consid- 
ered in conjunction with the following drawings in which: 

45 FIG. 1 A and FIG. 1 B illustrate operation of an MCU 
according to the prior art; 

FIG. 2 illustrates a telecommunications network ac- 
cording to an embodiment of the invention; 
FIG. 3 illustrates a multipoint control unit according 
so to an embodiment of the invention; 

FIG. 4 is a flowchart illustrating operation of an em- 
bodiment of the invention; 
FIG. 5 is a diagram schematically illustrating oper- 
ation of an exemplary implementation of the inven- 
ts tion; 

FIG. 6 is a flowchart illustrating operation of an em- 
bodiment of the invention; 
FIG. 7 is a diagram schematically illustrating oper- 
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ation of an exemplary implementation of the inven- 
tion; 

FIG. 8 is a flowchart illustrating operation of an em- 
bodiment of the invention; 

FIG. 9 is a diagram schematically illustrating oper- & 
ation of an exemplary implementation of the inven- 
tion; and 

FIG. 1 0 is a flowchart illustrating operation of anoth- 
er embodiment of the invention. 

DETAILED DESCRIPTION OF THE INVENTION 

[0012] FIGs. 2-10 illustrate an improved multipoint 
conferencing system and method. An embodiment of 
the present invention provides for more optimal selec- 
tion of codecs in a multipoint control unit. Optimal selec- 
tion of codecs may be based on minimizing bandwidth 
use, minimizing transcoding, or maximizing transmis- 
sion quality. Moreover, such optimization may occur ei- 
ther as new users are added to the multipoint confer- 
ence, or as particular users begin speaking. Finally, one 
or more users may be assigned a fixed higher or lower 
quality codec throughout the conference. 
[001 3] Turning now to the drawings, and with partic- 
ular attention to FIG. 2, a diagram illustrating an exem- 
plary H.323 telecommunications system 100 according 
to an embodiment of the present invention is shown. It 
is noted that, while described specifically in the context 
of voice packets, the present invention encompasses 
the use of any multimedia information, such as video, 
data, voice, or any combinations thereof. Moreover, an 
exemplary generic H.323 system is the Siemens Hi- 
Net™ RC 3000 system available from Siemens. 
[001 4] The telecommunications system 1 00 includes 
a local area network (LAN) or packet network 101 . Cou- 
pled to the LAN 1 01 may be a variety of H.323 terminals 
102a, 102b, 102c, 102d, a multipoint control unit (MCU) 
104 according to the present invention, an H.323 gate- 
way 106, an H.323 gatekeeper 108, a LAN server 112 
and a plurality of other devices such as personal com- 
puters (not shown). The H.323 terminals 102a, 102b, 
102c, 102d and H.323 gateway 106 and H.323 gate- 
keeper 108 are in compliance with the H.323 Recom- 
mendation. H.323 terminals 102 and H.323 gateway 
106 are each 'endpoints' as may be discussed below. 
The H. 323 endpoints support H.245 control signaling for 
negotiation of media channel usage, Q.931 (H.225.0) 
for call signaling and call setup, H.225.0 Registration, 
Admission, and Status (RAS), and RTP/RTCP for se- 
quencing audio and video packets. The H.323 end- 
points may further implement audio and video codecs, 
T.120 data conferencing protocols and MCU capabili- 
ties. Further details concerning the H.323 Recommen- 
dation may be obtained from the International Telecom- 
munications Union; the H.323 Recommendation is 
hereby incorporated by reference in its entirety as if fully 
set forth herein. 

[001 5] The MCU 1 04 includes a Transcoding Control 



Unit (TCU) 105. As shown in FIG. 3, the TCU 105 is 
coupled to a Multipoint Processor (MP) 110 and a 
Multipoint Controller (MC) 112. The MP 110 performs 
the actual media signal processing, i.e., switching, and 
the tike. The MC 112 handles H.245 capability negotia- 
tions to determine existence of a common codec. As will 
be explained in greater detail below, the TCU 105 pro- 
vides for more optimal selection of the codec which is 
to be used. The TCU 1 05 is programmed with the codec 
information for each of the users. When the conference 
is set up, the TCU 1 05 determines what common codec, 
if any, each of the parties possess, and causes a sign- 
aling message, RenegotiateCodec, to be relayed to par- 
ties to the conference that they will have to use the com- 
mon codec. If they are not currently using the common 
codec, they will need to renegotiate this portion of the 
call set-up with the MCU 104. It is noted that, while 
shown as discrete units, the MC 112, MP 110 and TCU 
105 may be embodied as one or more integrated proc- 
essors. Thus, the figures are exemplary only. 
[0016] Turning now to FIG. 4, a flowchart illustrating 
general operation of an embodiment of one aspect of 
the present invention is shown. First, in a step 400, the 
MCU 104 receives codec setup information concerning 
each of the parties on the network. The TCU 1 05 stores 
this information in a database (not shown), in a step 402. 
During a multipoint conference, the MCU 104 identifies 
the parties and accesses the database for their codec 
information, in a step 404. If predetermined optimization 
criteria are met through use of the default codecs, then 
in a step 408, the connections (in step 412) are negoti- 
ated using the defaults. As will be discussed in greater 
detail below, the optimization criteria may include mini- 
mizing transcoding, maximizing quality, or other desired 
criteria. Turning back to FIG. 4, if the optimization criteria 
are not met, then in a step 410, the MCU 109 and par- 
ticularly, the TCU 105 instructs the codecs of all con- 
cerned parties that the connections (in step 412) are to 
be renegotiated for optimal codec usage. 
[001 7] A first embodiment of the invention is illustrat- 
ed schematically by way of example with reference to 
FIG. 5. In the example of FIG. 5, the optimal coding 
choice to minimize transcoding is made every time a 
new user is added. In this example, the users have the 
coding capabilities as set forth in Table 1 . Thus, User A 
has GSM, G.723, and G.711 capabilities; User B has G. 
711 and G.723 capabilities; User C has G.723 capabil- 
ities; and User D has GSM capabilities. 
[0018] If User Acommunicates with User B, G.711 will 
be used, if possible. If not, then the G.723 codecs will 
be used. Then, suppose User B calls User C and con- 
ferences in User C using the conference feature. The 
codec choice is then renegotiated on the fly as shown 
in FIG. 5. That is, User A will now communicate with the 
MCU 104 using G.723, User B will communicate with 
MCU 104 using G.723, and User C will communicate 
with the MCU 104 using G.723. Thus, in the example of 
FIG. 5, the User A and the User B will need to renego- 



75 



20 



25 



30 



35 



40 



45 



50 



3 



5 



EP 1 077 565 A1 



6 



tiate (from G.711 to G.723). Once this is done, no trans- 
coding is needed because all the codecs are G.723. 
[0019] Next, if a User D is added to the conference, 
the MCU 104 will communicate with it using GSM, since 
that is the only codec supported by the User D. If GSM s 
is preferred by the MCU, then User A could be required 
to renegotiate the connection using GSM. 
[0020] As can be appreciated, depending on which 
party is talking, the MCU 1 04 may have little or no trans- 
coding to do. When User A talks using G.723 coding, 
User B and C receive the signal as is, and User D re- 
quires transcoding. When User B talks, Users A and C 
receive the signal as is, and User D requires transcod- 
ing. When User C talks, Users A and B receive the signal 
as is, and User D requires transcoding. When User D 
talks, Users A, B, and C all require transcoding. Never- 
theless, the amount of transcoding needed is less than 
in the case of FIG. 1A-1B. Moreover, in this embodi- 
ment, when each party is added, the optimal coding 
choice to minimize transcoding is made. For example, 
if Users E, F, and G were added, all with only GSM ca- 
pabilities, then User A would be switched to GSM, since 
a majority of the users support GSM rather than G.723. 
[0021] A flowchart illustrating operation of this embod- 
iment is shown in greater detail with reference to FIG. 
6. In a step 602, the MCU 104 and, in particular, the TCU 
105 receives information concerning endpoints on the 
network and their coding capabilities and stores them in 
a memory or database (not shown). In a step 604, the 
MCU 104 and, particularly, the MC 112, receives the 
multipoint conference call set-up commands, including 
identification of the users and their requested codecs. 
In a step 606, the TCU 105 receives the identification 
and codec requests, and accesses the user-codec da- 
tabase to organize the users by type of codec and de- 
termine the most common codec. In certain instances, 
a quality floor" or threshold may also be provided. Next, 
in a step 608, the TCU 1 05 determines whether the most 
common codec is in use or has been requested by all 
the users to the conference. If so, then the conference 
will proceed, in a step 610. If not, then in a step 612, the 
TCU 1 05 will cause the MC 1 1 2 to issue a Renegotiate- 
Codec command to the relevant users. The Renegoti- 
ateCodec command may include, as a parameter, an 
identification of the particular codec which is to be used. 
In a step 614, the relevant user sends a call setup com- 
mand which is received by the MCU 104's MC 112. The 
MC 11 2 recognizes the call setup command as pertain- 
ing to the particular conference and, in a step 616 un- 
dertakes the appropriate H.323 call control and signal- 
ing commands to set up the new connection using the 
new codec. Once the new connection has been estab- 
lished, in a step 618, the old connection is dropped. Fi- 
nally, in a step 620, the conference proceeds using the 
new codec selections. 

[0022] The embodiment described above modifies 
the coding choice as parties are added and dropped 
from the conference. In a second embodiment, howev- 



er, the coding choice is modified every time a different 
party talks. Thus, every time a new party talks, that party 
is identified as the dominant party by the MCU 104 and 
the MCU 104 issues the proper signals to renegotiate 
the rates with the endpoints. For example, turning to 
FIG. 7A, the example of Table 1 is again used. If User 
A is talking in a conference involving Users A, B, C, and 
D, then the connections should appear as in FIG. 7 A, if 
the quality of the connection is to be maximized. That 
is, the Users A and B communicate with the MCU 104 
using G.711; the User C communicates with the MCU 
104 using G.723; and the User 0 communicates with 
the MCU 104 using GSM. Alternatively, if transcoding is 
to be minimized, then the connections will be as shown 
in FIG. 7B. Thus, Users A, B, and C all communicate 
with the MCU using G.723; and User D communicates 
using GSM. 

[0023] A flowchart illustrating this embodiment of the 
invention is shown in FIG. 8. In a step 802, the multipoint 
conference is set up via the MCU 104. In a step 804, the 
TCU 105 receives the user identification and codec re- 
quests, and accesses the user-codec database. In a 
step 806, the TCU 105 detects a new user talking. In 
response, in a step 808, the TCU 105 accesses the da- 
tabase to determine whether codec usage is optimized. 
As discussed above, codec usage may be optimized to 
maximize quality of minimize transcoding. Next, in a 
step 810, the TCU 105 determines whether any of the 
users must renegotiate their codecs for optimization. If 
not, then in a step B24 ? the conference proceeds. How- 
ever, if they do, then in a step 812, the TCU 105 sends 
an identification of the user to the MC 112. The MC 112 
will issue a RenegotiateCodec command to the relevant 
users in a step 814. The RenegotiateCodec command 
may include, as a parameter, an identification of the par- 
ticular codec which is to be used. In a step 816, the rel- 
evant user sends a call setup command which is re- 
ceived by the MCU 104's MC 112. The MC 112 recog- 
nizes the call setup command as pertaining to the par- 
ticular conference and, in a step 818, undertakes the 
appropriate H.323 call control and signaling commands 
to set up the new connection using the new codec. Once 
the new connection has been established, in a step 820, 
the old connection is dropped. Finally, in a step 822, the 
conference proceeds using the new codec selections, 
until a new user talks and the system cycles back to step 
806. 

[0024] In another embodiment of the invention, the 
MCU 104 is configured to receive an identification of a 
particular user as a primary user; all others are identified 
as secondary. For example, in a teacher/lecturer envi- 
ronment, it may be desirable to provide the teacher with 
the highest quality codec when speaking, but the stu- 
dents with a tower quality one when questioning. In this 
case, the MCU 104 will cause the connection from the 
primary user and to the secondary users to be the high- 
est quality possible. The connection from the secondary 
users will be at a lower quality, to preserve system band- 
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width. For example, assume that user capabilities are 
as defined in Table 1 . If User A is chosen as the primary 
user, then its connection to the MCU 104 will be made 
using G.71 1 . As shown in FIG. 9, the MCU 104 will com- 
municate to the Users B, C and D using their highest 
quality codecs: G.711, G.723, and GSM, respectively. 
However, the Users B, C, and D will communicate to the 
MCU using a lower quality codec, if supported. Thus, 
User B will communicate to the MCU with G.723. 
This process is illustrated in greater detail with reference 
to FIG. 10. As shown, in a step 950, the TCU 105 re- 
ceives an identification of a primary and one or more 
secondary users. In a step 952, the multipoint confer- 
ence among those users begins. In a step 954, the sys- 
tem determines whether the primary user is speaking. 
If so, then in a step 960, the highest quality coding is 
used. If that coding is not currently being employed, then 
the connections are switched, in a manner similar to that 
described above. However, if in step 956 a secondary 
user was speaking, then in a step 958, lower quality co- 
decs are used. If such coding is not currently being em- 
ployed, then the coding is changed in a manner similar 
to that described above. 



Claims 



ing among two or more parties to a multipoint con- 
ference, the method characterized by: 

optimizing an amount of transcoding required 
to be performed by said transcoding to one or more 
s predetermined criteria. 

7. A method according to Claim 6, wherein said opti- 
mizing optimizes upon entry of new parties to a 
multipoint conference. 

10 

8. A method according to Claim 6, wherein said opti- 
mizing optimizes upon a new party communication. 

9. A method according to Claim 6, said predetermined 
is criteria comprising minimizing an amount of trans- 
coding required to be performed by said transcod- 
ing. 

10. A method according to Claim 6, said predetermined 
20 criteria comprising maximizing a coding quality. 

11. A method according to Claim 6, said predetermined 
criteria comprising assigning one or more of said 
parties to a predetermined codec and others of said 

25 parties to a different codec. 



10 

8. 



1. A multipoint control unit (MCU), including a 
multipoint controller (11 2) configured to perform call 
signaling between said MCU (104) and a plurality 30 
of endpoints (102, 106) and a multipoint processor 
(110) configured to perform transcoding between 
codecs of different types, characterized by: 

a transcoding control unit (105) configured to 
direct said multipoint controller (112) to signal at 35 
least one of said plurality of endpoints (1 02, 1 06) to 
communicate using a predetermined codec so as 
to optimize said transcoding to one or more prede- 
termined criteria. 

40 

2. An MCU according to Claim 1, wherein said trans- 
coding control unit (1 05) directs said multipoint con- 
troller (112) upon entry of an endpoint into a 
multipoint conference. 

45 

3. An MCU according to Claim 1, wherein said trans- 
coding control unit (1 05) directs said multipoint con- 
troller (112) when an endpoint begins communica- 
tion. 

so 

4. An MCU according to Claim 1 , wherein said criteria 
are to maximize quality. 

5. An MCU according to Claim 1 , wherein said criteria 
are to minimize transcoding required by said ss 
multipoint processor (110). 

6. A method for teleconferencing, including transcod- 
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